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0 Synthetic DNA probes. 



0 A synthetic oligonucleotide hybridization probe 
comprising at least one sequence at least 12 
nucleotides long, the probe forms duplexes in which 
at least 80% of the nucleotides are base paired with 
any nucleotide sequence that is either substantially 
of the form (Xt...Xi)n in which X is any nucleotide: i 
which designates the number of nucleotides in the 
sequence is at least 2 and n is at least 2, or 
substantially of the form [{X|.»Xi)mY]w in which X is 
any nucleotide, Y is any group of 1 or more 
nucleotides and is different from P(i...Xi), m is at 
least 1, w is 2, and i is at least 2, the number of 
nucleotides in said probe being less than 31 when i 
is greater than 3. 

00 

o 

CM 

o 

Q. 

Ul 




Europaisches Patentamt 
European Patent Office 
Office europden des brevets 



Xerox Copy Centre 



1 



EP 0 294 098 A1 



2 



SYNTHETIC DNA PROBES 



Reld of the Invention 



This invention relates to synthetic 
oligonucleotides useful to identify intrinsic polymor- 
phism of loci in human genomic DNA. 

Baclcground of the Invention 

The human genome contains several families 
of repetitive DNA sequences. Individual repetitive 
DNA families may be dispersed throughout the 
genome or may be organized In the form of iong 
arrays of simple tandem repeats (STR). Some STR 
sequences have been referred to as minisatellites 
but have not been isolated as sequences of unique 
buoyant density. (Jeffreys, et al (1985) Nature 
314:67; Jeffreys, et al. (1985) Nature 316:76 
(1985); Jeffreys U.K. Patent Application GB 2 166 
445 A published May 8. 1986.) Many of these 
minisatellites seem to be highly polymorphic due 
to variation in the number of copies of tandemly 
repeated units (VTNRs). Recently several loci 
which contain variable numbers of tandem repeats 
have been described. (Nakamura. et al., (1987) 
Science 235:1616-1632.) 

Some human VTNR loci have been isolated as 
cloned DNA's and characterized. (Nakamura, et al.. 
supra , n. 2; Wong, et al. (1986) Nucleic Acids 
Research 14:4605-4615.) Probes consisting of 
many tandem repeat units hybridize to multiple loci 
each having considerable restriction fragment 
length polymorphism (RFLP). The consequent in- 
dividual specific hybridization pattern fonms the ba- 
sis of genetic fingerprinting. 

Summary of the Invention 

This invention comprises synthetic 
oligonucleotide probes complementary to one or 
two repeat units of VTNR loci. Under high hy- 
bridization criteria, these probes hybridize locus 
specifically thus allowing the evaluation of the in- 
trinsic polymorphism of individual loci. 



Detailed Description of the Invention 



As the number of tandem repeat units of 
5 known minisatellite probes increases, specificity for 
DNA sequences of similar repeat unit length in- 
creases. (See, e.g.. the references cited supra, 
note 1; Jeffreys, et al. (1986) Am; i Hum. Genet. 
39:11-24.) 

70 Pursuant to this invention, probe length is limit- 
ed to render hybridization dependent upon the 
degree of matching and the stringency of the hy- 
bridization criteria. As distinguished from the prior 
art. the probes of this invention provide greater 

75 specificity for a single locus or for a small number 
of loci included in a DNA sequence. 

The synthetic oligonucleotide DNA hybridiza- 
tion probes of this invention comprise at least one 
sequence at least 12 nucleotides long which form 

20 duplexes in which at least about 80% of the 
nucleotides are base paired with any nucleotide 
sequence that is substantially of the fonm 

(i) (Xi...Xi)„ in which X is any nucleotide, i 
designates tiie number of nucleotides in the repeat 

25 and is at least 2, and n is at least 2, or substantially 
of the form 

(ii) [p<|..J<i)mY]w in which X is any 
nucleotide, Y is any group of one or more 
nucleotides and is different from (X|...Xi). m is at 

30 least 1 , w is 2 and i is at least 2, tiie number of 
nucleotides in said probe being less than 31 when i 
is greater than 3. 

To accommodate either consensus or mixed 
genomic sequences, the probes of the invention 

35 may include eitiier a single sequence or a mixture 
of sequences each at least 12 nucleotides long. 
The (X|...XO repeats may include minor sequence 
variations, hence the probes substantially conform 
to fonms (i) and (ii). 

40 A preferred embodiment comprises locus spe- 
cific probes which hybridize at high criteria to form 
100% matched duplexes. Interdependent con- 
straints are tinus placed both on the probe and on 
the locus. 

45 Table I reports the sequences of 1 1 syntiietic 
oligonucleotide probes (1, 2. and 4 to 12) which 
embody tills invention, and one prior art probe (3). 
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As Table I shows, under varying hybridization 
and wash criteria, these probes fall into four dif- 
ferent hybridization pattern categories. Category (1) 
probes produce highly polymorphic, muitiloci, pat- 
terns under low hybridization criteria and a locus 
specific polymorphic pattern under high criteria. 
Category (2) probes produce a highly polymorphic, 
muitiloci, pattern under both low as well as high 
criteria, although a few minor bands are lost in 
subsequent washes. Category (3) prior art probes 
produce a polymorphic muitiloci pattem under low 
criteria and allele specific but non-polymorphic pat- 
tern under high criteria. The category (4) probe 
produces a locus specific and polymorphic pattern 
under low as well as high criteria. 

Hybridization criteria are based on temperature 
and cation, typically sodium ion. concentration. Fre- 
quently the temperature is varied whiier the cation 
concentration is constant. 

The following empirical fonnnula has been sug- 
gested for the calculation of Td (the temperature at 
which one-half of the oligonucleotide duplex be- 
comes dissociated in 1 M NaCI) for a given 
oligonucleotide: 

= 4*C(G + C) + 2* (A + T) 
where (G + C) is the number of Q and C bases in 
the oligonucleotide and (A + T) the number of A 
and T bases in the oligonucleotide, (See Suggs, at 
al. "Development Biology Using Purified Genes" 
(D. Brown and C.F. Fox. Eds.) Academic Press. 
New Yori< (1981).) 

However, finding optimal hybridization and 
wash conditions is empirical in nature. Hence the 
formula is but a rule of thumb useful only for most 
oligonucleotides 11 to 23 bases in length. 

For the purposes of this invention for 
oligonucleotide probes not more than 23 
nucleotides in iength. high criteria hybridizations 
may be done at a temperature of from about 0- 
10* C, preferably §bout s'C below Ta. For longer 
oligonucleotide probes, hybridization and washes 
can be done at 65' C. In each case, post-hybridiza- 
tion washes are accomplished at temperatures suf- 
ficiently high to leave only the duplexes where 
80% of the nucleotide are base paired. 

The synthetic oligonucleotide probes listed in 
Table I were synthesized using an automated DNA 
synthesizer and purified by polyacrylamide gel 
electrophoresis and HPLC using a PRP-1 reverse 
phase column (Hamilton). Oligonucleotide (10 
pmole) were labeled at the 5' terminal hydroxyl 
group using y-pp p^jp (20 pmole NEN, 7000Ci/m 
mole) and T4 polynucleotide kinase 5-7 units 
(NEN) in 10 ul reaction volume containing 67 mM 
Tris-HCI at pH 8.0, 10 mM Mg (0Ac)2. and 10 mM 
dithiothreitol. The kinase reactions were carried out 
at 37 *C for 45 min. The labeled oligonucleotides 



were then separated from the [gamma 32P] ATP 
by chromatography over DE 52 Cellulose 
(Whatman) as described elsewhere. ( See Ali, et al. 
(1986) Hum.Genet 74: 239.) Gels containing sam- 

5 pies from unrelated individuals were hybridized to 
each probe at 37* C. After each hybridization, the 
gels were washed in SxSSC at various tempera- 
tures ranging from 37* C to 70* C. After each wash, 
autoradiograms were recorded. Some of the repre- 

10 sentative hybridization results are presented by 
Rgure 1 . 

To obtain the data reflected by Rgures 1A to 
IE. DNA was isolated from blood cells of unrelated 
individual donors according to the methods of Ali, 

15 supra . 3-5 u.g DNA was digested with Hint I (BRL) 
according to the supplier's specification and elec- 
trophoresed in 0.7% agarose gel (Seakem, ME) in 
recirculating TAE buffer (40 mM Tris. 12 mM so- 
dium acetate, 2 mM EDTA. pH 8.3). For size es- 

20 timation. phage lambda DNA predigested with Hind 
111 restriction enzyme was included in the gel elec- 
trophoresis. After the electrophoresis, the gets were 
dried on a gel dryer (Bio-Rad) for 1 hr at room 
temperature followed by another 1 hr at 60* C. 

25 Prior to the hybridization, the gels were processed 
for denaturation and neutralization as described 
elsewhere. (Jeffreys, et al. (1985) Nature 314:67.) 
Gel hybridizations were carried out for 16 to 20 hr 
at 37* C in 5xSSPE [IxSSPE = 180 mM NaCI. 10 

30 mM sodium phosphate, 1 mM EDTA, pH 8.0] 0.1% 
SOS, 10 ug/ml sonicated and denatured E. coli 
DNA and 1 x 10^ cpm/ml labeled oligonucleotide 
probe. After the hybridization, gels were washed in 
6xSSC at room temperature for 3-4 hr, and ex- 

35 posed to X-ray film (Kodak X-AR) with two inten- 
sifying screens (Dupont lightning plus) overnight 
(Figures 1A. C and E). After obtaining the first 
autoradiogram, the gels were washed at increasing 
temperatures including 37. 40. ^, 50, 55. 60, 

40 65* C. and some of the gels at 70* C for 5-10 min 
or longer depending upon the signal intensity and 
after each wash, the autoradiograms were obtained 
(Rgures IB, D and F). 

Only the following representative autoradiog- 

45 rams are shown. 

Rgure 1A: Autoradiograms of the gel 
hybridized with oligonucleotide probe 0-33.6 (Table 
I) and exposed without any stringent wash. 

Rgure 1B: Same gel after a wash at 65*C 

50 for 10 min. Note the locus specific polymorphic 
bands. Ali the other bands are lost during the 
sequential washes at the higher temperature (not 
shown). 

Rgure 1C: Autoradiogram of the gel 
55 hybridized with 0-33.1 probe (Table I). Several 
polymorphic bands are seen ranging from 10.1 to 
l.8kb. 
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Rgure 1D: The same gel after a final wash at 
65* C for 10 min. Note the one or two bands in the 
size range of 2.5 to 1 .6kb. 

Figure 1 E: Autoradlogram of the gel 
hybridized with the 0.33.15 probe (Table I) and 
exposed without any stringent wash. 

Rgure IF: The same gel after the wash at 
65* C for 15 min/ All the bands are eliminated 
except a 1 .3 kb band present in all the individuals. 
In lane 6. there is an additional band of about 
l.45kb. This is the only sample that shows this 
additional band; otherwise, all the individuals are 
alike. 

The first oligonucleotide probe 0-33.6. a 22 
base long probe complementary to two consensus 
repeat units of the Jeffreys 33.6 sequence. (See 
All, supra , note 6.) produces a highly polymorphic 
pattern giving rise to a number of shared and 
unshared bands when the gel is exposed without 
any stringent wash (Rg. 1A). There are four bands 
of 3.8, 2.8, 2.4 and 2.1 kb which appear to be 
common in all the individuals. These non-polymor- 
phic fragments appear together with several poly- 
morphic fragments ranging from 6-18 kb. After 
washing the gel at increasingly higher tempera- 
tures, a locus specific hybridization pattern 
emerges, where all the bands except those In the 
size range of 1.3 to 1.5 kb are eliminated (Rg. IB). 
These bands are obviously polymorphic; however, 
since the repeat unit of this locus is only 11 
nucleotides, the different alleles are not resolved 
well. These bands which are removed by higher 
stringency washes may reflect related sequences 
which are not completely complementary to the 
probe. 

Similariy, the seventh oligonucleotide 0-33.1. a 
27 base long probe complementary to a single 
consensus repeat unit of the 33.1 sequence, pro- 
duces a complex polymorphic pattern under low 
stringency conditions (Rg. 1C) comprises mostly of 
unshared bands in the size range of 9.5 to 1 .9 kb, 
except for one of 10.1 kb which is relatively faint 
and seems to be common amongst ail six in- 
dividuals. After the high stringency wash, all but 
one or two bands are removed. Of the six in- 
dividuals analyzed, four of them have two bands in 
the size range of 1.7 to 2.5 kb, whereas two of 
them have only one band each of about 2.5 kb 
(Fig. ID). Thus, under high stringency conditions, 
this oiigonucteotide probe revealed five alleles in 
the six individuals examined (2.5. 2.3, 2.1. 2.0 and 
1.7 kb). 

In contrast to the results obtained with the two 
probes discussed above, the second probe 0- 
33.15, a 32 base long probe complementary to two 
consensus repeat units of the 33.15 sequence, 
produced a complex DNA fingerprint pattem when 
hybridized and washed at low stringency (Fig. IE) 



but a non-polymorphic locus specified pattern 
when washed at high stringency (Rg. IF), Under 
the high stringency wash condition, a 1.3 kb non- 
polymorphic band is seen in ail individuals (Rg. 
5 IF). 

In addition to this 1.3 kb non-polymeric band, 
there is an additional 1.45 kb band present In one 
individual (Rg. IF. lane 6). It is difficult to account 
for the presence of this additional band. Only the 

10 1.3 kb band was found when the genomic DNA of 
several other unrelated individuals was analyzed 
(not shown). Based on the level of hybridization of 
the probe to the two bands in this individual, it is 
reasonable to assume that this additional band is 

76 not an allele of the 1.3 kb band but rather is due to 
the presence of an additional locus complementary 
to the probe (no family members of this individual 
are available to confirm this suggestion). 

The cloned VTNR sequence 33.6 has been 

20 described as having an 1 1 nucleotide repeat unit or 
alternatively as having a 37 nucleotide repeat unit 
made up of three 1 1 nucleotide repeats plus addi- 
tional nucleotides. An oligonucleotide 0-33.6-37, a 
37 nucleotide long probe 14 complementary to one 

26 consensus 37 nucleotide repeat, was synthesized. 
Genomic DNA from several unrelated individuals 
was digested with Hinf I and electrophoresed on a 
1.5% agarose gel. The DNA in the gel was 
hybridized with the 37 base long probe under strin- 

30 gent hybridization and post-hybridization wash con- 
ditions. As can be seen in Rg. 2A, the 33.6 locus is 
highly polymorphic, with three of the nine in- 
dividuals examined being heterozygous. At least 5 
alleles are resolved in the 13 chromosomes stud- 

35 ies. This probe thus revealed the intrinsic polymor- 
phism of the 33.6 locus which was suggested by 
the results in Fig. lA. Genomic DNA from a family 
was analyzed in the same way (Rg. 2B). The 
results are consistant with a single Mendelian !o- 

40 cus. 

The synthetic oligonucleotide probes of this 
invention are of particular utility for genetic iden- 
tification purposes, patemity and matemity testing, 
diagnosis of genetic diseases and cancer, and in 
45 the study of post bone man'ow transplant 
chimerism. See. e.g.. Yam, et al., Am.J.Hum.Genet . 
41:867-881 (1987) and parent application Serial No. 
055.224. 

50 

Claims 

1. A synthetic oligonucleotide hybridization 
probe comprising at least one sequence at least 1 2 
55 nucleotides long, said probe forms duplexes in 
which at least 80% of the nucleotides are base 
paired with any nucleotide sequence that is either 
substantially of the form 
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(al {X|..J(|)n In which X is any nucleotide; i 
which designates the number of nucleotides in the 
sequence is at least 2 and n is at least 2, or 
substantially of the form 

(b) ((X|...Xi)mYlw in which X is any 
nucleotide, Y is any group of 1 or more nucleotides 
and Is different from (X,...XO. m is at least 1 . w Is 2, 
and i is at least 2. the number of nucleotides In 
said probe being less than 3i when i is greater than 
3. 

2. A synthetic DNA hybridization probe as de- 
fined by claim which comprises a mixture of se- 
quences each at least 12 nucleotides long. 

3. A synthetic DNA hybridization probe as de- 
fined by claim which comprises a single sequence 
at least 12 nucleotides long. 

4. A locus specific synthetic DNA probe which 
hybridizes at high criteria to form 100% matched 
duplexes with a nucleotide sequence as described 
by claim 1. 

5. A probe as defined by claim 4 which Is not 
more than 23 nucleotides in length and which 
forms at least 80% matched duplexes with any 
sequence as defined in subparagraphs (a) and (b) 
of claim 1 when hybridized at a temperature of 
from about 5*C to about 10* C below Td followed 
by post-hybridization washes at temperatures suffi- 
ciently high to leave only the duplexes where at 
least 80% of the nucleotides are base paired. 

6. A probe as defined by claim 4 which is more 
than 23 nucleotides in length and which fonms at 
least 80% matched duplexes with any sequences 
as defined in subparagraphs (a) and (b) of claim 1 
when the hybridization is accomplished at 65 and 
the post-hybridization washes are accomplished at 
temperatures sufficiently high to leave only du- 
plexes where at least 80% of the nucleotides are 
base paired. 

7. A synthetic oligonucleotide which consists of 
or comprises any one cr more of the sequences: 

(a) 5' (CCTCCAGCCCT) 3' 

(b) 5' (CACCTCTCCACCTGCC) 3' 

(c) 6 QAGQARYAGAAAGGYQR- 
GYRVTGTGQGCQ 3' in which R is A or G, or A 
and Q. Y is C or T. or C and T. and V is not T 

(d) 5' ^ - 
(AGQARYAGAAAGGYGRGYRVTGTQGQC 3 in 
which R Is A or G 

(e) 5' TCCTQAACAACCCCACTGTACTTC- 

CCA3 

(f) , 5 
GTGCCTGCTTGCCTTCCCTCTCTTGTC 3 

{g) 5' CCTGCTCCGCTCACQTGGGCCACG- 

CAC 3 

(h) 5' (CCR)8CC 3' in which R is A or G 

(i) 5' (CCR)8CC 3 

Q) 5' GCCCTTCCTCCGGAGCCGTCCTC- 
CAGCCCTTCCTCCA 3' 



(k) 5' AGGARYAGAAAGGYGR- 

GYRVTGTGGGC 3' 

(I) 5' CACTCCCCCTTCTCTCCAGGG- 
GACGCC 3' 

5 8. A method of genetically identifying a sample 
of human genomic DNA which comprises hybridiz- 
ing said DNA with a synthetic oligonucleotide as 
defined in claim 1 . 

9. A method of genetically identifying a sample 
70 of human genomic DNA which comprises hybridiz- 
ing said DNA with a synthetic oligonucleotide as 
defined in claim 2. 

10. A method of genetically identifying a sam- 
ple of human genomic DNA which comprises 

75 hybridizing said DNA with a synthetic 
oligonucleotide as defined in claim 3. 

11. A method of genetically identifying a sam- 
ple of human genomic DNA which comprises 
hybridizing said DNA with a synthetic 

20 oligonucleotide as defined in claim 4. 

12. A method of genetically identifying a sam- 
ple of human genomic DNA which comprises 
hybridizing said DNA with a synthetic 
oligonucleotide as defined in claim 7. 

25 13. A synthetic oligonucleotide probe as de- 
fined in claim 1 which, under high criteria yields a 
locus specific, polymorphic hybridization pattern. 

14. A synthetic oligonucleotide as defined by 
claim 1 which, at high criteria, yields a multiloci 

30 polymorphic hybridization pattern. 

15. A synthetic oligonucleotide probe as de- 
fined in claim 1 which forms said at least 80% 
matched duplexes at high criteria. 

16. A probe as defined in claim 15 not more 
35 than 23 nucleotides in length which forms said at 

least 80% matched duplexes when hybridization is 
accomplished at a temperature of from about 5*0 
to about 10* C below Td and post-hybridization 
washes are accomplished at temperatures suffi- 
40 ciently high to leave only the duplexes where at 
least about 80% of the nucleotides are base paired. 
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